Cosine Similarity as Machine Reading Technique

نویسندگان

  • Gaurav Arora
  • Prasenjit Majumder
چکیده

Question answering for Machine reading evaluation track is a aim to check machine understanding ability of a machine.As we analyzed most crusial part for efficient working of this system is to select text which needs to be considered for understanding since understanding text would involve a lot of NLP processing. This paper covers our submitted system for QA4MRE campaign, Which mostly focuses on two part first being selecting text from comprehension and background knowledge needed to be understand and second being eliminating or ranking options based on selected text from former step.Our main focus was on eliminating and ranking which boils down to tunning various parameter for selection whether to answer particular question if answered how to consider scores,Following methods like calculating cosine between question and passage sentences,cosine of named entities output of passage sentences and question were also considered for scoring .In addition to this basic frame work of our system negation of sentences were also considered to answers which received very close score.We also considered expansion of question and options respectively to collect relevant information from background collection.Entity Co-referencing and normalization were some of important preprocessing to consider on passage and background collection as we analyzed since it can increase score of sentence or option which do not directly mention entity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using an Information Retrieval Technique to Discover Malicious Software

This paper describes a research effort to detect unknown, known or variances of known malicious software using an information retrieval technique known as cosine similarity. Document similarity techniques, such as cosine similarity, have been used with great success in several document retrieval applications. By following the standard information retrieval methodology, software, in machine read...

متن کامل

SOME SIMILARITY MEASURES FOR PICTURE FUZZY SETS AND THEIR APPLICATIONS

In this work, we shall present some novel process to measure the similarity between picture fuzzy sets. Firstly, we adopt the concept of intuitionistic fuzzy sets, interval-valued intuitionistic fuzzy sets and picture fuzzy sets. Secondly, we develop some similarity measures between picture fuzzy sets, such as, cosine similarity measure, weighted cosine similarity measure, set-theoretic similar...

متن کامل

Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model

We show how to consider similarity between features for calculation of similarity of objects in the Vec­ tor Space Model (VSM) for machine learning algorithms and other classes of methods that involve similarity be­ tween objects. Unlike LSA, we assume that similarity between features is known (say, from a synonym dictio­ nary) and does not need to be learned from the data. We call the proposed...

متن کامل

Dimension independent similarity computation

We present a suite of algorithms for Dimension Independent Similarity Computation (DISCO) to compute all pairwise similarities between very high-dimensional sparse vectors. All of our results are provably independent of dimension, meaning that apart from the initial cost of trivially reading in the data, all subsequent operations are independent of the dimension; thus the dimension can be very ...

متن کامل

Retrieval Structure Construction During Reading: Experimentation and Simulation

The aim of this study was to investigate the construction of a retrieval structure during reading, according to the hypothesis that text macrostructure is used in Long-term working memory (Ericsson & Kintsch, 1995) to maintain encoded information in an accessible format. We first designed an experiment for testing the hypothesis that retrieval structure is a macrostructure of the text. Then, we...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011